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Abstract. By considering a least squares approximation of a given square 
integrable function /:[0, 1]" -> R by a multilinear polynomial of a specified 
degree, we define an index which measures the overall interaction among vari- 
ables of /. This definition extends the concept of Banzhaf interaction index 
introduced in cooperative game theory. Our approach is partly inspired from 
multilinear regression analysis, where interactions among the independent vari- 
ables are taken into consideration. We show that this interaction index has ap- 
pealing properties which naturally generalize several properties of the Banzhaf 
interaction index. In particular, we interpret this index as an expected value 
of the difference quotients of / or, under certain natural conditions on /, as an 
expected value of the derivatives of /. Finally, we discuss a few applications 
of the interaction index in aggregation function theory. 



1. Introduction 

Sophisticated mathematical models are extensively used in a variety of areas 
of mathematics and physics, and especially in applied fields such as engineering, 
life sciences, economics, finance, and many others. Here we consider the simple 
situation where the model aims at explaining a single dependent variable, call it y, 
in terms of n independent variables xi, . . . ,x„. Such a model is usually described 
through an equation of the form 

y= f{xi,-.-,Xn), 

where / is a real function of n variables. 

Now, suppose that the function / describing the model is given and that we want 
to investigate its behavior through simple terms. For instance, suppose we want 
to measure the overall contribution (importance or influence) of each independent 
variable to the model. A natural approach to this problem consists in defining the 
overall importance of each variable as the coefficient of this variable in the least 
squares linear approximation of /. This approach was considered by Hammer and 
Holzman [1^ for pseudo-Boolean functions and cooperative games /:{0,1}" R. 
Interestingly enough, they observed that the coefficient of each variable in the linear 
approximation is exactly the Banzhaf power index 3 , T of the corresponding player 
in the game /. 

In many practical situations, the information provided by the overall importance 
degree of each variable may be far insufficient due to the possible interactions among 
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the variables. Then, a more flexible approach to investigate the behavior of / 
consists in measuring an overall importance degree for each combination (subset) 
of variables. Such a concept was first introduced in [16' for Boolean functions 
/:{0, 1}" -* {0,1} (see also [U |6]), then in [17 for pseudo-Boolean functions and 
games /:{0,1}" M. (see also [13): and in |9] for square integrable functions 
/: [0,1]" ^M. 

In addition to these importance indexes, we can also measure directly the inter- 
action degree among the variables by defining an overall interaction index for each 
combination of variables. This concept was introduced axiomatically in |13j (see 
also [8^) for games /:{0, 1}" R. However, it has not yet been extended to real 
functions defined on [0, 1]", even though such functions are of growing importance 
for instance in aggregation function theory. In this paper we intend to fill this 
gap by defining and investigating an appropriate index to measure the interaction 
degree among variables of a given square integrable function /: [0, 1]" M. 

Our sources of inspiration to define such an index are actually threefold: 

In cooperative game theory: Interaction indexes were introduced axiomat- 
ically a decade ago [13, for games /:{0, 1}" R (see also ^). The best 
known interaction indexes are the Banzhaf and Shapley interaction indexes, 
which extend the Banzhaf and Shapley power indexes. Following Hammer 
and Holzman's approach [TJ, it was shown in [TTj that the Banzhaf in- 
teraction index can be obtained from least squares approximations of the 
game under consideration by games whose multilinear representations are 
of lower degrees. 

In analysis: Considering a sufficiently differentiable real function / of several 
variables, the local interaction among certain variables at a given point a 
can be obtained through the coefficients of the Taylor expansion of / at 
a, that is, through the coefficients of the local polynomial approximation 
of / at a. By contrast, if we want to define an overall interaction index, 
we naturally have to consider a global approximation of / by a polynomial 
function. 

In statistics: Multilinear statistical models have been proposed to take into 
account the interaction among the independent variables (see for instance 
[1]): two-way interactions appear as the coefficients of leading terms in qua- 
dratic models, three-way interactions appear as the coefficients of leading 
terms in cubic models, and so forth. 

On the basis of these observations, we naturally consider the least squares ap- 
proximation problem of a given square integrable function /: [0, 1]" -> R by a poly- 
nomial of a given degree. As multiple occurrences in combinations of variables are 
not relevant, we will only consider multilinear polynomial functions. Then, given a 
subset S c {1, . . . , n}, an index I(/, S) measuring the interaction among the vari- 
ables {xi : z e 5} of / is defined as the coefficient of the monomial OieS Xi in the best 
approximation of / by a multilinear polynomial of degree at most \S\. This defini- 
tion is given and discussed in Section 2, where we also provide an interpretation in 
the context of cooperative fuzzy games (Remark [T]). 

In Section 3 we show that this new index has many appealing properties, such 
as linearity, continuity, and symmetry. In particular, we show that, similarly to 
the Banzhaf interaction index introduced for games, the index X(/, S) can be inter- 
preted in a sense as an expected value of the discrete derivative of / in the direction 
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of S (Theorem I10|) or, equivalently, as an expected value of the difference quotient 
of / in the direction of S (CoroUary [TT]). Under certain natural conditions on /, 
the index can also be interpreted as an expected value of the derivative of / in the 
direction of S (Proposition [7]). These latter results reveal a strong analogy between 
the interaction index and the overall importance index introduced by Grabisch and 
Labreuche [9]. 

In Section 4 we discuss certain applications in aggregation function theory, in- 
cluding the computation of explicit expressions of the interaction index for the 
discrete Choquet integrals. We also define and investigate a normalized version 
of the interaction index to compare different functions in terms of interaction de- 
grees of their variables and a coefficient of determination to measure the quality of 
multilinear approximations. 

We employ the following notation throughout the paper. Let I" denote the 
n-dimensional unit cube [0,1]". We denote by the class of all functions 

^ R and by the class of square integrable functions /:I" ^ M (modulo 

equality almost everywhere). For any S £ N = we denote by I5 the 

characteristic vector of S in {0, 1}". 

2. Interaction indexes 

In this section we first recall the concepts of power and interaction indexes in- 
troduced in cooperative game theory and how the Banzhaf index can be obtained 
from the solution of a least squares approximation problem. Then we show how 
this approximation problem can be extended to functions in and, from this 

extension, we introduce an interaction index for such functions. 

Recall that a (cooperative) game on a finite set of players N = {l,...,n} is a 
set function M. which assigns to each coalition S of players a real number 

v{S) representing the worth of Through the usual identification of the subsets 
of N with the elements of {0, 1}", a game v:2^ ^M. can be equivalently described 
by a pseudo-Boolean function /: {0, 1}" K. The correspondence is given by 
v{S) = /(Is) and 

(1) /(x)= z<s)Yi^, n (1-^0- 

ScJV ieS ieN\S 

Equation ([1]) shows that any pseudo-Boolean function /:{0,1}" can always be 
represented by a multilinear polynomial of degree at most n (see (15)). which can 
be further simplified into 

(2) /(x)= E a{S)U^,, 

where the set function a: 2^ R, called the Mobius transform of v, is defined by 

a{S)= E(-i)i^i-i^i«(r). 

Let denote the set of games on N. A power index fIV on is a function 
(j):Q^ X TV ^ R that assigns to every player i e in a game / € his/her 
prospect ^i(/, i) from playing the game. An interaction index |13| on A'^ is a function 
/: X 2^ ^- R that measures in a game / e the interaction degree among the 
players of a coalition S £ N. 



Usually, the condition ti(0) = is required for v to define a game. However, we do not need 
this restriction in the present paper. 
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For instance, the Banzhaf interaction index |13j of a coalition S Q N in a game 
/ e t/^ is defined (in terms of the Mobius transform of /) by 

/I \ \T\-\S\ 

(3) iB{f,s)=T.L) «m> 

and the Banzhaf power index T of a player i e iV in a game / e is defined by 
0b(/,*) = /b(/,{*}). 

It is noteworthy that /b(/, S) can be interpreted as an average of the S -difference 
(or discrete S-derivative) f of /. Indeed, it can be shown (see [11] §2]) that 

(4) Mf,S) = ^ E (A^/)(x), 

^ xe{0,1}" 

where A^/ is defined inductively by A^/ = / and A'^/ = A^^^A'^^^'^/ for i e S, 
with A{*>/(x) = /(x I X, = 1) - /(x I X, = 0). 

We now recall how the Banzhaf interaction index can be obtained from a least 
squares approximation problem. For k e {0,...,n}, denote by Vk the set of all 
multilinear polynomials g: {0, 1}" ^ R of degree at most k, that is of the form 

(5) 5(x)= E <s)Yi^,, 

SzN ieS 
\S\^k 

where the coefficients c{S) are real numbers. For a given pseudo-Boolean function 
/: {0, 1}" M, the best kth approximation of / is the unique multilinear polynomial 
fk 6 Vk that minimizes the squared distance 

E (/(x)-.9(x))2 

xe{0,l}" 

among all g e Vfc. A closed- form expression of fk was given in [14j for fc = 1 and 
k = 2 and in jll) for arbitrary fc ^ n. In fact, when / is given in its multilinear form 
we obtain 

\S\ik 

where 

OkiS) = a{S) . (-1) J (2) -(^)- 

\T\>k 

It is then easy to see that 

(6) /B(/,5) = a|s|(5). 

Thus, /b(/, iS) is exactly the coefficient of the monomial Ylii^s the best ap- 
proximation of / by a multilinear polynomial of degree at most \S\. 

Taking into account this approximation problem, we now define an interaction 
index for functions in as follows. Denote by Wk the set of all multilinear 

polynomials I" ^ K of degree at most fc. Clearly, these functions are also of the 
form ([5]). For a given function / € we define the best kth (multilinear) ap- 

proximation of f as the multilinear polynomial fk e Wk that minimizes the squared 
distance 

(7) _^J/(x)-5(x))'dx 
among all g e Wk ■ 
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It is easy to see that Wk is a linear subspace of L^(I") of dimension J]s=o (")■ 
Indeed, Wk is the Unear span of the basis = {vs ■ S £ N, \S\ 4 k}, where 
the functions ■yg:!" R are defined by ws(x) = TlieS^i- Note that formula ([7]) 
also writes ||/- where || • || is the standard norm of associated with 

the inner product {f,g) = /j„ /(x)g(x)(ix. Therefore, using the general theory of 
Hilbert spaces, the solution of this approximation problem exists and is uniquely 
determined by the orthogonal projection of / onto Wk- This projection can be 
easily expressed in any orthonormal basis of Wk- But here it is very easy to see 
that the set B'^. = {ws ■S£N,\S\^ k}, where ws-P ^ M is given by 

z.,(x) = 121-1/^ n - J) = 12'"'^' E ( - ^)'""'"'-t(x), 

forms such an orthonormal basis for Wk - 

The following immediate theorem gives the components of the best fcth approx- 
imation of a function / e in the bases Bk and _B^.. 

Theorem 1. For every k e {0, . - . ,n} , the best kth approximation of f e L^(I") is 
the function 

(8) /fc = E (f,WT)wT = E ak{S)vs, 

\T\ik \S\^k 

where 

(9) ak{S)= E (-If'^'^Um/'dwr). 

\T%k 

By analogy with ([5]), to measure the interaction degree among variables of an 
arbitrary function / e L^(I"), we naturally define an index X 2^ ^ K as 

I{f,S) = a|5|(5), where a|5|(S') is obtained from / by ©. We will see in the next 
section that this index indeed measures an importance degree when |S'| = 1 and an 
interaction degree when \S\ ^ 2. 

Definition 2. Let I:L^{r) x 2^ ^ M be defined as I{f,S) = U^'^^/^if.ws), that 
is, 

(10) X(/,5) = 12l^l r /(x) n(a;»-J)dx. 

Thus we have defined an interaction index from an approximation (projection) 
problem. Conversely, this index characterizes this approximation problem. Indeed, 
as the following result shows, the best kth approximation of / e i^(I") is the unique 
function of Wk that preserves the interaction index for all the s-subsets such that 
s ^ k- The discrete analogue of this result was established in [11] for the Banzhaf 
interaction index ([3]) . 

Proposition 3. A function fk e Wk is the best kth approximation of f e L'^(I") if 
and only ifX{f, S) = I{fk, S) for all S £ N such that \S\ ^ fc. 

Proof- By definition, we have l{f,S) = I{fk,S) if and only if (/ - fk,u!s) = for 
all S" £ such that IS*] ^ k, and the latter condition characterizes the projection of 
/ onto Wk- □ 
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The exphcit conversion formulas between the interaction index and the best 
approximation can be easily derived from the preceding results. On the one hand, 
by (O, we have 

/ 1\|T|-|S| 

^k{s)= E "o ioT\s\^k. 

|T|sfc 

On the other hand, by Proposition [3] and Equation we also have 

= I(^,5) = 12l^l/2(/,,^i;5) 



that is, by calculating the inner product, 

\T\-\S\ 

T3S ^ 



(11) ALS)= E (lf^'^'^a,{T), bT\S\^k. 



We also note that, by ([8]), the best kth approximation of / can be expressed in 
terms of I as 

(12) /fe(x)= E ALT)Yli^^-l)- 

\T\^h 

Using the notation ^ = (i,...,^), the Taylor expansion formula then shows that 
I(/,5) = (I?^A)(i), forl^l^fc, 

where stands for the partial derivative operator with respect to the variables 
Xi for i € 5. In particular, 0) = /j„ /(x) dx = /fc(^). 

We also have the following result, which shows that the index I generalizes the 
Banzhaf interaction index /b- First note that the restriction operation / i-^ /|{o.i}" 
defines a linear bijection between the spaces Wn and Vn- The inverse map is the 
so-called "multilinear extension" . 

Proposition 4. For every junction f e Wn and every subset S £ N , we have 
J(/,S) = /b(/|{04}",^)- 

Proof. Let / e Wn of the form /(x) = Y,ton a{T) UteT and let S ^ N. Then, 
using (fTTj) for k = n and recalling that a(T) = a„(T) for every T £ N, we obtain 



/lx|T|-|S| 



T2S ~2' 

We then conclude by formula ([3]). □ 

Remark 1. In cooperative game theory, the set can be interpreted as the 

set oi fuzzy games (see for instance Aubin (^). In this context, a fuzzy coalition is 
simply an element x e I" and a fuzzy game f e F{V') is a mapping that associates 
with any fuzzy coalition its worth. It is now clear that the index I is a natural 
extension of the Banzhaf interaction index to fuzzy games in when this index 

is regarded as a solution of a multilinear approximation problem. 
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3. Properties and interpretations 

Most of the interaction indexes defined for games, including the Banzhaf inter- 
action index, share a set of fundamental properties such as linearity, symmetry, and 
monotonicity (see [8, ) . Many of them can also be expressed as expected values of 
the discrete derivatives (differences) of their arguments (see for instance ([4])). In 
this section we show that the index I fulfills direct generalizations of these proper- 
ties to the framework of functions of In particular, we show that I{f,S) 
can be interpreted as an expected value of the difference quotient of / in the di- 
rection of S or, under certain natural conditions on /, as an expected value of the 
derivative of / in the direction of S. 

The first result follows from the very definition of the index. 

Proposition 5. For every S ^ N , the mapping f i-^ S) is linear and continu- 
ous. 

Recall that if tt is a permutation on N, then, for every function / e F{I" ), the 
permutation tt acts on / by 7r(/)(xi, . . . , x„) = /(a;^(i), . . . , a;7r(n) ). The following 
result is then an easy consequence of the change of variables theorem. 

Proposition 6. The index X is symmetric. That is, for every permutation tt on 
N, every f € L'^iT), and every SqN, we have I{Tr{f),Tr{S)) =I{f, S) . 

We now provide an interpretation of T{f,S) as an expected value of the S- 
derivative f of /. The proof immediately follows from repeated integrations by 
parts of (fTO|) and thus is omitted. 

For S ^ N, denote by hs the probability density function of independent beta 
distributions on I" with parameters a = /3 = 2, that is, /is(x) = 6'^' Yii^s ~ ^i)- 

Proposition 7. For every S <^ N and every f e L^(I") such that f is continuous 
and integrable on ]0, 1[" for all T ^ S, we have 

(13) I{f,S)= f /^s(x)i?^/(x)dx. 

Remark 2. (a) Formulas Q and show a strong analogy between the in- 
dexes Ib and I. Indeed, /b(/, S) is the expected value of the S'-difference 
of / with respect to the discrete uniform distribution whereas X(/, S) is the 
expected value of the S'-derivative of / with respect to a beta distribution. 
We will see in Theorem [TU] a similar interpretation of I{f,S) which does 
not require all the assumptions of Proposition [T] 
(b) Propositions [3] and [7] reveal an analogy between least squares approxima- 
tions and Taylor expansion formula. Indeed, while the fc-degree Taylor 
expansion of / at a given point a can be seen as the unique polynomial 
of degree at most k whose derivatives at a coincide with the derivatives of 
/ at the same point, the best fcth approximation of / is the unique mul- 
tilinear polynomial of degree at most k that agrees with / in all average 
S-derivatives for |5| ^ k. 

We now give an alternative interpretation of I(/, S) as an expected value, which 
does not require the additional assumptions of Proposition [T] In this more general 
framework, we naturally replace the derivative with a difference quotient. To this 
extent, we introduce some further notation. As usual, we denote by the ith 
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vector of the standard basis of M". For every S Q N and every h € I", we define the 
S -shift operator on F(r) by 



i?^/(x) = /(x+^ 

\ jeS I 



for every x € I" such that x + h e I" . 

We also define the S -difference (or discrete S -derivative) operator on 
inductively by Ag/ = / and Af/ = A^^^ A^^^'^ f for i e S, with A|^'V(x) = 
E^^ f{x) - /(x). Similarly, we define the S-difference quotient operator on 

F{r) by Qlf = f and Q^f = Q^Qf^'V for i e S, with gi^V(x) = x-A^'^/Cx). 

The next straightforward lemma provides a direct link between the difference 
operators and the shift operators. It actually shows that, for every fixed h e I", the 
map 5 1-^ Aj^ is nothing other than the Mobius transform of the map S i-* E^. 

Lemma 8. For every f e and every S £ N , we have 

(14) Ag/(x)= E(-1)I^HT|^T^(,)_ 



Let us interpret the ^'-difference operator through a simple example. For n = 3 
and S = {1,2}, we have 

Af/(x) = f{xi+hi,X2+h2,X3)-f{xi+hi,X2,X3)-f{xi,X2+h2,X3)+f{xi,X2,X3). 

In complete analogy with the discrete concept of marginal interaction among players 
in a coalition S £ N (see [11] §2]), the value A^/(x) can be interpreted as the 
marginal interaction among variables Xi {i e 5) at x with respect to the increases 
hi for i e S. 

Setting h = y - X in the example above, we obtain 

Ay_x/(x) = f{yi,y2,X3) - f{yi,X2,X3) - f{xi,y2,X3) + /(xi, 2:2, 2:3). 

If Xi ^ yi for every i e S, then Ay_x/(x) is naturally called the f -volume of the 
box ni6s[^ii The following straightforward lemma shows that, when / = vs, 
Ay_x/(x) is exactly the volume of the box Yli^si^iTVi]- 

Lemma 9. For every S £ N , we have Ay_^vs{x.) = YliesiVi ~ ^i)- 

In the remaining part of this paper, the notation e [x^,!] means that yi e 
[xi, 1] for every i e S. 

Theorem 10. For every f e ^^(1") and every S £ N, we have 
(15) I(/,5) = -5-r [ A^_,/(x)dy5dx, 

where 

KS)= f f A^_^vs{x)dysd^ = 6-\^l 



Proof. Since the result is trivial if S" = 0, we can assume that S + 0. We first 
observe that the value of /i(S') immediately follows from Lemma [SI Then, for any 
T £ N and any i eT, we have 

(16) J Ey_^f{x)dyidxi = yiEy_^f{x)dyi = Xi Ey^^'^ f{x) dx^, 
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where the first equahty is obtained by permuting tlic integrals and the second equal- 
ity by replacing the integration variable jji with Xi. Moreover, we have immediately 

(17) [ [ f{x.)dyidx^= [ {1 - Xi) f{x.) dxi. 

Jo Jxi JO 

Using (fT4|) and repeated applications of (fT6|) and ([T7| . we finally obtain 
f f A^_,/(x)dy5dx 

Jxel" Jyse[xs,l] 

TzS ieT ieSxT 

= f n(2a;^-l)/(x)rfx = 6-l^ll(/,5), 

which completes the proof. □ 

Remark 3. (a) By Lemma [9l we see that T{f,S) can be interpreted as the 
average /-volume of the box ni<Es[a;i, divided by its average volume, 
when X and ys are chosen at random with the uniform distribution. 

(b) As already mentioned in Remark [5J a), Theorem [TUl appears as a natural 
generalization of formula ^ (similarly to Proposition [7]) in the sense that 
the marginal interaction A]^/(x) at x is averaged over the whole domain 
I" (instead of its vertices). 

(c) We note a strong analogy between formula p5|) and the overall importance 
index defined by Grabisch and Labreuche in [9l Theorem 1] . Indeed, up to 
the normalization constant, this importance index is obtained by replacing 
in formula ([T5|) the operator Ay_^ by Ey_^ - I. Moreover, when 5* is a 
singleton, both operators coincide and so do the normalization constants. 

As an immediate consequence of Theorem I10| we have the following interpreta- 
tion of the index I as an expected value of the difference quotients of its argument 
with respect to some probability distribution. 

Corollary 11. For every f e i^(I") and every S £ N , we have 

I(/,5)= f f ps(x,ys)g^-x/(x)dysdx, 

where the function p5(x, yg) = 6''^' Yli^siUi ~ ^i) defines a probability density func- 
tion on the set {(x,y5) : x e I",y5 e [xg, 1]}. 

Let us now analyze the behavior of the interaction index I on some special 
classes of functions. The following properties generalize in a very natural way to 
our setting the behavior of the Banzhaf interaction index Ib with respect to the 
presence of null players and dummy coalitions. 

Recall that a null player in a game (or a set function) v e is a player i e N 
such that v{T u {i}) = v{T) for every T ^ N \ {i}. Equivalently, the corresponding 
pseudo-Boolean function /:{0,1}" M, given by ([1]), is independent of Xi. The 
notion of null player for games is then naturally extended through the notion of 
ineffective variables for functions in F(I") as follows. A variable Xi {i e N) is said 
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to be ineffective for a function / in if /(x) = E^ll f{x) for every x e I", or 

equivalently, if Ay}^f{x.) = for every x, y e I". 

Define If = {i e N ■■ Xi ineffective for /}. From either (fTO|) or ([T5l) . we imme- 
diately derive the following result, which states that any combination of variables 
containing at least one ineffective variable for a function / € has necessarily 

a zero interaction. 

Proposition 12. For every f e L^(I") and every S 9 N such that S n If t 0, we 
have T{f, S) = 0. 

We say that a coalition 5 £ TV is dummy in a game (or a set function) v € if 
v{RuT) = v{R)+v{T)-v{0) for every RqS and every T £ iV\ 5. This means that 
{S, N \ S} forms a partition of N such that, for every coalition K Q N, the relative 
worth v{K) -v{0) is the sum of the relative worths of its intersections with S and 
N \ S. It follows that a coalition S and its complement N \ S are simultaneously 
dummy in any game v & ■ 

We propose the following extension of this concept. 

Definition 13. We say that a subset 5* c TV is dummy for a function / e if 
/(x) = E^^f{yi) + £;i^^^/(x) - /(O) for every x e I" . 

The following proposition gives an immediate interpretation of this definition. 

Proposition 14. A subset S ^ N is dummy for a function f e if and only 

if there exist functions /s,/Af\S s i^(I") such that If^ ^ N \ S, If^^s - ^ '^'^'^ 

f = fs + fN\S- 

Proof For the necessity, just set /s(x) = E^^'^ f{x) - f{0) and /tvxS = f-fs- The 
sufficiency can be checked directly. □ 

The following result expresses the natural idea that the interaction for subsets 
that are properly partitioned by a dummy subset must be zero. It is an immediate 
consequence of Propositions El [121 and [141 

Proposition 15. For every f e every nonempty subset S £ N that is 

dummy for f, and every subset K Q N such that K nS =f and K \ S ^ 0, we have 
Iif,K) = 0. 

Another immediate consequence of PropositionjlJlis that additive functions have 
zero interaction indexes for s-subsets with s ^ 2. This fact can be straightforwardly 
extended to the class of /c-additive functions as follows. 

Definition 16. A function / e is said to be k-additive for some fc e {1, . . . , n} 

if there exists a family of functions {fa e : i? £ iV, |i?| ^ k} satisfying 

If^2N\R such that f = T,B.fR- 

Corollary 17. Let f = Y,r fn e be a k-additive function and let S N. We 

have J(/, S) = if \S\ > k and S) = T{fs,S) if \S\ = k. 

Let us now introduce the concept of 5'-increasing monotonicity by refining the 
classical concept of n-increasing monotonicity for functions of n variables (see for 
instance ^ p. 43]). 

Definition 18. Let S £ N. We say that a function / e F(I") is S -increasing if 
^y-x/(x) ^ for all X, y € r such that x ^ y. 



MEASURING THE INTERACTIONS AMONG VARIABLES 



11 



The following result then follows immediately from Theorem [TOl 

Proposition 19. If f e is S -increasing for some S £ N, then I{f, S) ^ 0. 

We end this section by analyzing the behavior of the index I with respect to 
dualization, which is a standard concept for instance in aggregation function theory 
(see [101 P- 48]). The dual of a function / e F{r) is the function e F{r) defined 
by /-^(x) = l-/(lAr-x). A function /€F(r) is said to be self-dual \i = f . By 
using the change of variables theorem, we immediately derive the following result. 

Proposition 20. For every f e L^(I") and every nonempty S £ N, we have 
Xif^.S) = (-l)l^l+il(/,S'). Moreover, 1(^^,0) = l-I(/,0). In particular, if 
f is self-dual, then X(f,0) = 1/2 and I{f, S) = whenever \S\ is even. 

Remark 4. Given / e L'^(I"), we define the self-dual and anti- self- dual parts of / 
by f = {f + f'^)l2 and /" = (/- /'*)/2, respectively. It follows from Proposition UHl 
that, for every nonempty S £ N, we have I{f,S) = I{f°',S) if |5| is even, and 
T{f,S)=T{r,S) if |5| is odd. 

4. Applications to aggregation function theory 

When we need to summarize, fuse, or merge a set of values into a single one, 
we usually make use of a so-called aggregation function, e.g., a mean or an aver- 
aging function. Various aggregation functions have been proposed thus far in the 
literature, thus giving rise to the growing theory of aggregation which proposes, 
analyzes, and characterizes aggregation function classes. For recent references, see 
Beliakov et al. |4] and Grabisch et al. jlQ, . 

In this context it is often useful to analyze the general behavior of a given aggre- 
gation function / with respect its variables. The index T then offers a good solution 
to the problems of (i) determining which variables have the greatest influence over 
/ and (ii) measuring how the variables interact within /. 

In this section we first compute explicit expressions of the interaction index for 
the discrete Choquet integral, a noteworthy aggregation function which has been 
widely investigated due to its many applications for instance in decision making (see 
for instance [12] )■ Then we proceed similarly for the class of pseudo-multilinear 
polynomials, which includes the multiplicative functions and, in particular, the 
weighted geometric means. Finally, we introduce a normalized version of the index 
to compare interactions from different functions and compute the coefficient of 
determination of the multilinear approximations. 

4.1. Discrete Choquet integrals. A discrete Choquet integral is a function / e 
F(P) of the form 

(18) /(x)= ^ a(T)mina:., 

where the set function a:2^^Ris nondecreasing with respect to set inclusion and 
such that a(0) = and EscAra(S') = iQ For general background, see for instance 
[ini Section 5.4]. 



^Whether the conditions on the set function a are assumed or not, the function given in JTSj 
is also called the Lovdsz extension of the pseudo-Boolean function /|{o,i}"- 
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The following proposition yields an explicit expression of the interaction index 
for the class of discrete Choquet integrals. We first consider a lemma and recall 
that the beta function is defined, for any integers p,q > 0, by 

(p-l)!(q-l)! 



B{p,q)= [\p-\i-ty-' 

Jo 



dt 



{p + q-iy. 
Lemma 21. We have 

f min..n(-.--)^xJ2""^(l^l^^'l^l^l)' 
J [0,1]" ieT ^^^V 2' 10, otherwise. 

Proof. The result is trivial if S $ T. Thus, we assume that S £ T. Assume also 
without loss of generality that T + 0. 

For distinct real numbers xi, . . . ,Xm we have 

T[imXi=Y,X3 n H{xi-Xj), 

where iJ:M ^ R is the Heaviside step function {H{x) = 1 if a; ^ and otherwise). 
Therefore, we have 

We then conclude by calculating this latter integral by parts. □ 
Proposition 22. If f e is of the form n8\) . then we have 

I(/,5) = 6l^l Z a(T)S(|5| + l,|T| + l). 

Remark 5. The map a ^ X{f,S) = 6^^^ ^.t^s ^iT) B{\S\ + 1,|T| + 1) defines an 
interaction index, in the sense of |8], that is not a probabilistic index (see fSJ Sec- 
tion 3.3]). However, if we normalize this interaction index (with respect to \S\) to 
get a probabilistic index, we actually divide I{f,S) by 6''^'i3(|S'| + 1,151 + 1) and 
retrieve the index Im defined in |19j . 

4.2. Pseudo-multilinear polynomials. We now derive an explicit expression of 
the index X for the class of pseudo-multilinear polynomials, that is, the class of 
multilinear polynomials with transformed variables. 

Definition 23. We say that a function / e L^(I") is a pseudo-multilinear poly- 
nomial if there exists a multilinear polynomial g e F{W^) and n unary func- 
tions ipi,...,ipn e L^i^) such that /(x) = g{Lpi{xi) , . . . ,Lpn{xn)) for every x = 
{xi,...,Xn) el". 



+ 
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Using expression ([5]) of multilinear polynomials, we immediately see that any 
pseudo- multilinear polynomial / e (I" ) can be written in the form 

/(X)= ^ a{T)Ylv^{x^). 

The following result yields an explicit expression of the interaction index for this 
function in terms of the interaction indexes for the unary functions ipi, . . . ,ipn. 

Proposition 24. For every pseudo-multilinear polynomial f e and every 

S c N , we have 

Proof. By linearity of I, we only have to compute Xdli^T V'ii It is zero if 
S $ T hy Proposition [T2l li S £ T, we simply use (fTO|) and compute the integrals 
separately. □ 

Remark 6. Proposition [24] can actually be easily extended to functions of the form 

/(x)= ^ aiT)Yl'pI{^^), 

where ipf e L'^{1) for i = 1, . . . ,n and T £ N. 

An interesting subclass of pseudo-multilinear polynomials is the class of mul- 
tiplicative functions, that is, functions of the form /(x) = YVi=iVi{^i)j where 
ifi, ...,(/?„€ For every multiplicative function / e and every S £ N, 

assuming 0) ^ 0, the ratio 0) is also multiplicative in the sense 

that 

^ ' I(/,0) I(^„0) ■ 

Combining this with ((T2| and the identity Etcn HjeT = HieNi^ + can 
write the best nth approximation of / as 

^"«-<^>-'n(i»(..-i)). 

4.3. Normalized index and coefficients of determination. Just as for inter- 
action indexes introduced in game theory ^13; and the importance index defined by 
Grabisch and Labreuche [5], the interaction index I is a linear map. This implies 
that it cannot be considered as an absolute interaction index but rather as a relative 
index constructed to assess and compare interactions for a given function. 

If we want to compare interactions for different functions, we need to consider 
an absolute (normalized) interaction index. Such an index is actually easy to define 
if we use the following probabilistic viewpoint: considering the unit cube I" as a 
probability space with respect to the Lebesgue measure, we see that, for a nonempty 
subset S £ N, the index X(/, S) is actually the covariance of the random variables / 
and It is then natural to consider the Pearson (or correlation) coefficient 

instead of the covariance. 

Definition 25. The normalized interaction index is the mapping 
r: {/ 6 L2(r) : a(/) * 0} X (2^ X {0}) - R 
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defined by 

-(/'^) = i2i5i/2^(/)=(^(7r'^^/' 

where E{f) and cr(/) are the expectation and the standard deviation of /, respec- 
tively, when / is regarded as a random variable. 

From this definition it follows that -1 ^ r(/, 5) ^ 1. Moreover, this index remains 
unchanged under interval scale transformations, that is, r{af + b,S) = r(f,S) for 
all a > and 6 e R. Note also that the normalized indexes for a function / e 
and its dual are linked by r{f,S) = (-1)1^1+1 r(/, S'), where 5 + 0. 

Let us examine on a few examples the behavior of the normalized importance 
index r{f, {i}) of a variable Xi'. 

• For the arithmetic mean /(x) = ^ X"=i Xi, we have o'(/) = (12n)^/^, {«}) 
1/n for all i e N, and hence ?'(/,{«}) = 

• For the minimum function /(x) = min,,=ArXi, we have 



(see [m Lemma 6]). By Proposition [521 we then have 

K/,{0) 



^^71(71, + 2) 

for every i € N. By duality, the same result holds for the maximum function 
/''(x) = maxi^N Xi- From this fact, we measure the intuitive fact that the 
overall importance of a given variable is greater in the arithmetic mean 
than in the minimum and the maximum functions. 

Consider the weighted geometric mean /(x) = n"=i a:^' , where ci, . . . , c„ ^ 
and E"=i Ci = 1. Using (fT9| . for every nonempty subset S £ N, we have 



In the special case of the symmetric geometric mean function, we have 

Here again, we can show that the importance of variables in the arithmetic 
mean is greater than the importance of variables in the geometric mean 
function. 

The normalized index is also useful to compute the coefficient of determination 
of the best fcth approximation of /Q Assuming that it(/) i= 0, this coefficient is 
given by 



RiU) 



Since E{fu) = T{fk,0) = T{f,0) = E{f) (see Proposition [S]), by dH), we obtain 

2 



<y\fk) = \\fk-EUk)f 



TcAT 



Ton 



ls;|T|«fc l!S|T|^/c 



^This coefficient actually measures the goodness of fit of the multilinear model. 



MEASURING THE INTERACTIONS AMONG VARIABLES 



15 



and hence 

(20) Rlif)= E rif,Tf. 

Ton 

Remark 7. The coefficient of determination explains why the normahzed impor- 
tance of each variable in the arithmetic mean is greater than that in the minimum 
function, the maximum function, and the geometric mean function. Indeed, if / 
is a symmetric function, then r{f,{i}) = r(f,{j}) for every i,j e N and, since 
Rfif) ^ 1, by (1201) we have r(/, {i}) ^ 

Acknowledgments 

The authors wish to thank Michel Beine, Miguel Couceiro, Paul Gerard, and 
Samuel Nicolay for fruitful discussions. This research is supported by the internal 
research project F1R-MTH-PUL-09MRDO of the University of Luxembourg. 

References 

[1] L. S. Aiken and S. G. West. Multiple Regression: Testing and Interpreting Interactions. 

Newbury Park-London-New Delhi: Sage Publications, 1991. 
[2] J. -P. Aubin. Cooperative fuzzy games. Math. Oper. Res., 6(1):1-13, 1981. 
[3] J. F. Banzhaf. Weighted voting doesn't work: A mathematical analysis. Rutgers Law Review, 

19:317-343, 1965. 

[4] G. Beliakov, A. Pradera, and T. Calvo. Aggregation Functions: A Guide for Practitioners. 

Studies in Fuziness and Soft Computing. Springer, Berlin, 2007. 
[5] M. Ben-Or and N. Linial. Collective coin flipping. In Randomness and Computation, pages 

91-115. Academic Press, New York, 1990. 
[6] J. Bourgain, J. Kahn, G. Kalai, Y. Katznelson, and N. Linial. The influence of variables in 

product spaces. Isr. J. Math., 77(l-2):55-64, 1992. 
[7] P. Dubey and L. S. Shapley. Mathematical properties of the Banzhaf power index. Math. 

Oper. Res., 4:99-131, 1979. 
[8] K. Fujimoto, L Kojadinovic, and J.-L. Marichal. Axiomatic characterizations of probabilistic 

and cardinal-probabilistic interaction indices. Games Econom. Behav., 55(1):72— 99, 2006. 
[9] M. Grabisch and C. Labreuche. How to improve acts: An alternative representation of the 

importance of criteria in MCDM. Int. J. Uncertain. Fuzziness Knowl. -Based Syst., 9(2):145— 

157, 2001. 

[10] M. Grabisch, J.-L. Marichal, R. Mesiar, and E. Pap. Aggregation functions, volume 127 of 
Encyclopedia of Mathematics and its Applications. Cambridge University Press, Cambridge, 
2009. 

[11] M. Grabisch, J.-L. Marichal, and M. Roubens. Equivalent representations of set functions. 

Math. Oper. Res., 25(2):157-178, 2000. 
[12] M. Grabisch, T. Murofushi, and M. Sugeno, editors. Fuzzy measures and integrals - Theory 

and applications, volume 40 of Studies in Fuzziness and Soft Gomputing. Physica-Verlag, 

Heidelberg, 2000. 

[13] M. Grabisch and M. Roubens. An axiomatic approach to the concept of interaction among 
players in cooperative games. Int. J. Game Theory, 28(4):547-565, 1999. 

[14] P. Hammer and R. Holzman. Approximations of pseudo-Boolean functions; applications to 
game theory. Z. Oper. Res., 36(1):3-21, 1992. 

[15] P. Hammer and S. Rudeanu. Boolean methods in operations research and related areas. Berlin- 
Heidelberg-New York: Springer- Verlag, 1968. 

[16] J. Kahn, G. Kalai, and N. Linial. The influence of variables on Boolean functions. In Proc. 
29th Annual Symposium on Foundations of Computational Science, pages 68-80. Computer 
Society Press, 1988. 

[17] J.-L. Marichal. The influence of variables on pseudo-Boolean functions with applications to 
game theory and multicriteria decision making. Discrete Appl. Math., 107(1-3):139-164, 2000. 



16 



JEAN-LUC MARICHAL AND PIERRE MATHONET 



[18] J.-L. Marichal, I. Kojadinovic, and K. Fujimoto. Axiomatic characterizations of generalized 

values. Discrete Applied Mathematics, 155(l):26-43, 2007. 
[19] J.-L. Marichal and P. Mathonct. Approximations of Lovasz extensions and their induced 

interaction index. Discrete Appl. Math., 156(l):ll-24, 2008. 
[20] R. B. Nelsen. An introduction to copulas. Springer Series in Statistics. Springer, New York, 

second edition, 2006. 

[21] L. Shapley. A value for n-person games. In Contributions to the Theory of Games II (Annals 
of Mathematics Studies 28), pages 307-317. Princeton University Press, 1953. 

Mathematics Research Unit, FSTC, University of Luxembourg, 6, rue Coudenhove- 
Kalergi, L-1359 Luxembourg, Grand Duchy of Luxembourg. 
E-mail address: jean-luc.marichal[at]uni.lu 

Mathematics Research Unit, FSTC, University of Luxembourg, 6, rue Coudenhove- 
Kalergi, L-1359 Luxembourg, Grand Duchy of Luxembourg. 
E-mail address: pierre.mathonet [at]uiii .lu 



